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This invention relates to coding multichannel signals including a at least a first 
and a second signal component. More particularly, the invention relates to the coding of 
multiphonic audio signals, such as stereophonic signals. 



5 

Stereophonic audio signals comprise a left (L) and a right (R) signal 
component which may originate from a stereo signal source, for example from separated 
microphones. The coding of audio signals aims at reducing the bit rate of a stereophonic 
signal, e.g. in order to allow an efficient transmission of sound signals via a communications 
10 network, such as the Internet, via a modem and analogue telephone lines, mobile 

communication channels or other a wireless networks, etc., and to store a stereophonic sound 
signal on a chip card or another storage medium with limited storage capacity. 

US patent no. 4,589,127 discloses a transmitter for stereophonic signals which 
generates sum and difference signals of the stereo L and R signals. The sum and difference 
1 5 signals are subsequently used in a modulation scheme resulting in a modulated signal 
comprising both the sum and difference information. 

However, the above prior art method does not address the problem of 
encoding a stereo signal with an efficient bit-rate utilisation, i.e. with a low bit rate for a 
given sound quality. 

20 

The above and other problems are solved by a method of encoding a 
multichannel signal including at least a first signal component and a second signal 
component, the method comprising the steps of 
25 - transforming at least the first and second signal components by a predetermined 

transformation into a principal signal including most of the signal energy and at least one 
residual signal including less energy than the principal signal, the predetermined 
transformation being parameterised by at least one transformation parameter; and 
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- representing the multichannel signal at least by the principal signal and the transformation 
parameter. 

Consequently, by transforming the multichannel signal into a principal signal 
which includes most of the energy and a residual signal which only includes little signal 
5 energy, the multichannel signal may be represented by the principal signal, the 

transformation parameter, and optionally a small residual signal, thereby improving the 
coding efficiency for the multichannel signal. Effectively, the multichannel signal may be 
encoded with a bit rate which is only slightly higher than that of a single channel, e.g. a mono 
channel. The resulting encoded signal may be stored and/or communicated to a receiver. 

1 0 When the method further comprises the step of adaptively determining the 

transformation parameter based on at least the first and second signal components, the 
optimal transformation parameter may continuously be tracked, thereby ensuring the 
transformation remains optimal even if the characteristics of the input signal changes, e.g. in 
the example of an audio signal due to a moving sound source or changes in acoustic 

1 5 properties of the environment. 

When the predetermined transformation is a rotation and the transformation 
parameter corresponds to an angle of rotation, a simple transformation is provided based only 
on a single parameter, the angle of rotation. By adapting the angle such that the signal 
components, e.g. the L and R signal components of a stereo signal, are rotated into a principal 

20 component signal and a residual signal, an efficient coding is provided while maintaining a 
high quality signal. 

In a preferred embodiment of the invention, the step of representing the 
multichannel signal at least by the principal signal and the transformation parameter further 
comprises the step of representing the multichannel signal by the principal signal, the 

25 transformation parameter, and the residual signal, thereby further improving the quality of the 
encoded signal as no signal information is discarded. As the residual signal is small 
compared to the principal signal, the bit allocation may be traded between those signals. 
Furthermore, as the bit rate allocation may be varied, a mechanism for graceful degradation is 
provided, e.g. by adaptively increasing or decreasing the bit rate allowed for the residual 

30 signal. 

Consequently, in a further preferred embodiment, the step of representing the 
multichannel signal by the principal signal, the transformation parameter, and the residual 
signal further comprises the steps of 
- encoding the principal signal with a first bit rate; and 
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- encoding the residual signal with a second bit rate smaller than the first bit rate. 

In another preferred embodiment, 

- the method further comprises the step of estimating the residual signal from the principal 
signal using a prediction filter corresponding to a set of filter parameters; and 

5 - the step of representing the multichannel signal at least by the principal signal and the 
transformation parameter comprises the step of representing the multichannel signal by 
the principal signal, the transformation parameter, and the set of filter parameters. 

This embodiment of the invention is based on the recognition that for many 
multichannel signals, e.g. in the case of audio signals for music and speech signals, the 
10 residual signal may be estimated as a filtered version of the principal signal. Hence, when 
determining a set of filter parameters of an adaptive filter which models the residual signal, 
the filter parameters may be encoded together with the principal signal and the 
transformation parameter. Consequently, communicating the residual signal is avoided 
without loosing the information included in that signal, thereby providing an efficient 
1 5 encoding which preserves a high level of quality. 

It is an advantage of the invention that it provides an efficient bit-rate 
utilisation, i.e. a coding scheme which uses a low bit rate for a given sound quality. The 
coding scheme according to the invention may be used to reduce the bit rate without 
significantly reducing the sound quality, to maintain the bit rate while improving the sound 
20 quality, or a combination of the above. 

The invention further relates to a method of decoding multichannel signal 
information, the method comprising the steps of 

- receiving a principal signal and a transformation parameter, the principal signal 
corresponding to a result of a predetermined transformation of at least a first and a second 

25 signal component of a multichannel source signal, the predetermined transformation 

being parameterised by at least the transformation parameter; and 

- generating a first and a second decoded signal component by inversely transforming the 
received principal signal and at least one residual signal. 

The present invention can be implemented in different ways including the 
30 methods described above and in the following, arrangements for encoding and decoding 
multichannel signals, respectively, a data signal, and further product means, each yielding 
one or more of the benefits and advantages described in connection with the first-mentioned 
method, and each having one or more preferred embodiments corresponding to the preferred 
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embodiments described in connection with the first-mentioned method and disclosed in the 
dependant claims. 

It is noted that the features of the methods described above and in the 
following may be implemented in software and carried out in a data processing system or 
other processing means caused by the execution of computer-executable instructions. The 
instructions may be program code means loaded in a memory, such as a RAM, from a storage 
medium or from another computer via a computer network. Alternatively, the described 
features may be implemented by hardwired circuitry instead of software or in combination 
with software. 

The invention further relates to an arrangement for encoding a multichannel 
signal including at least a first signal component and a second signal component, the 
arrangement comprising 

first processing means adapted to transform at least the first and second signal 
components by a predetermined transformation into a principal signal including most of 
the signal energy and at least one residual signal including less energy than the principal 
signal, the predetermined transformation being parameterised by at least one 
transformation parameter; and 

- second processing means adapted to represent the multichannel signal at least by the 
principal signal and the transformation parameter. 

The invention further relates to an arrangement for decoding multichannel 
signal information, the arrangement comprising 

- receiving means for receiving a principal signal and a transformation parameter, the 
principal signal corresponding to a result of a predetermined transformation of a first and 
a second multichannel source signal, the predetermined transformation being 
parameterised by at least the transformation parameter; and 

- processing means for generating a first and a second multichannel signal by inversely 
transforming the received principal signal and a residual signal. 

The above arrangements may be part of any electronic equipment including 
computers, such as stationary and portable PCs, stationary and portable radio 
communications equipment and other handheld or portable devices, such as mobile 
telephones, pagers, audio players, multimedia players, communicators, i.e. electronic 
organisers, smart phones, personal digital assistants (PDAs), handheld computers, or the like. 

The term processing means comprises general- or special-purpose 
programmable microprocessors, Digital Signal Processors (DSP), Application Specific 
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Integrated Circuits (ASIC), Programmable Logic Arrays (PLA), Field Programmable Gate 
Arrays (FPGA), special purpose electronic circuits, etc., or a combination thereof. The above 
first and second processing means may be separate processing means or they may be 
comprised in one processing means. 
5 The term receiving means includes circuitry and/or devices suitable for 

enabling the communication of data, e.g. via a wired or a wireless data link. Examples of 
such receiving means include a network interface, a network card, a radio receiver, a receiver 
for other suitable electromagnetic signals, such as infrared light, e.g. via an IrDa port, radio- 
based communications, e.g. via Bluetooth transceivers, or the like. Further examples of such 
10 receiving means include a cable modem, a telephone modem, an Integrated Services Digital 
Network (ISDN) adapter, a Digital Subscriber Line (DSL) adapter, a satellite transceiver, an 
Ethernet adapter, or the like. 

The term receiving means further comprises other input circuits/devices for 
receiving data signals, e.g. data signals stored on a computer-readable medium. Examples of 
15 such receiving means include a floppy-disk drive, a CD-Rom drive, a DVD drive, or any 
other suitable disc drive, a memory card adapter, a smart card adapter, etc. 

The invention further relates to a data signal including multichannel signal 
information, the data signal being generated by a method described above and in the 
following. The signal may be embodied as a data signal on a carrier wave, e.g. as a data 
20 signal transmitted by communications means as described above and in the following. 

The invention further relates to a computer-readable medium comprising a 
data record indicative of multichannel signal information generated by a method described 
above and in the following. The term computer-readable medium comprises magnetic tape, 
optical disc, digital video disk (DVD), compact disc (CD or CD-ROM), mini-disc, hard disk, 
25 floppy disk, ferro-electric memory, electrically erasable programmable read only memory 
(EEPROM), flash memory, EPROM, read only memory (ROM), static random access 
memory (SRAM), dynamic random access memory (DRAM), synchronous dynamic random 
access memory (SDRAM), ferromagnetic memory, optical storage, charge coupled devices, 
smart cards, PCMCIA card, etc. 
30 The invention further relates to a device for communicating a multichannel 

signal including at least a first signal component and a second signal component, the device 
comprising an arrangement for encoding the multichannel signal as described above and in 
the following. 
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These and other aspects of the invention will be apparent from and elucidated 
with reference to the embodiments and with reference to the drawing, in which: 

fig. 1 shows a schematic view of a system for communicating stereo signals 
5 according to an embodiment of the invention; 

fig. 2 shows a schematic view of an arrangement for encoding a stereo signal 
according to a first embodiment of the invention; 

fig. 3 illustrates the determination of the signal transformation according to an 
embodiment of the invention; 
10 fig. 4 shows a schematic view of an arrangement for decoding a stereo signal 

according to a first embodiment of the invention; 

fig. 5 shows a schematic view of an arrangement for encoding a stereo signal 
according to a second embodiment of the invention; 

fig. 6 shows a schematic view of an arrangement for decoding a stereo signal 
1 5 according to a second embodiment of the invention; 

figs. 7a-c show schematic views of examples of a filter circuit for use in an 
embodiment of the invention; 

fig. 8 shows a schematic view of an arrangement for encoding a stereo signal 
according to a third embodiment of the invention; 
20 fig. 9 shows a schematic view of an arrangement for encoding a stereo signal 

according to a fourth embodiment of the invention; 

fig. 10 shows a schematic view of an arrangement for decoding a stereo signal 
according to the fourth embodiment of the invention; 

fig. 1 1 shows a schematic view of an arrangement for encoding a multichannel 
25 signal according to a fifth embodiment of the invention; 

fig. 12 shows a schematic view of an arrangement for encoding a multichannel 
signal according to a sixth embodiment of the invention; and 

fig. 13 shows a schematic view of a subtraction circuit for use with an 
embodiment of the invention. 

30 

Fig. 1 shows a schematic view of a system for communicating stereo signals 
according to an embodiment of the invention. The system comprises a coding device 101 for 
generating a coded stereophonic signal and a decoding device 105 for decoding a received 
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coded signal into a stereo L signal and a stereo R signal component. The coding device 101 
and the decoding device 105 each may be any electronic equipment or part of such 
equipment. Here the term electronic equipment comprises computers, such as stationary and 
portable PCs, stationary and portable radio communication equipment and other handheld or 

5 portable devices, such as mobile telephones, pagers, audio players, multimedia players, 
communicators, i.e. electronic organisers, smart phones, personal digital assistants (PDAs), 
handheld computers, or the like. It is noted that the coding device 101 and the decoding 
device may be combined in one electronic equipment where stereophonic signals are stored 
on a computer-readable medium for later reproduction. 

10 The coding device 101 comprises an encoder 102 for encoding a stereophonic 

signal according to the invention, the stereophonic signal including an L signal component 
and an R signal component. The encoder receives the L and R signal components and 
generates a coded signal T. The stereophonic signal L and R, may originate from a set of 
microphones, e.g. via further electronic equipment, such as a mixing equipment, etc. The 

15 signals may further be received as an output from another stereo player, over-the-air as a 
radio signal, or by any other suitable means. Preferred embodiments of such an encoder 
according to the invention will be described below. According to one embodiment, the 
encoder 102 is connected to a transmitter 103 for transmitting the coded signal T via a 
communications channel 109 to the decoding device 105. The transmitter 103 may comprise 

20 circuitry suitable for enabling the communication of data, e.g. via a wired or a wireless data 
link 109. Examples of such a transmitter include a network interface, a network card, a radio 
transmitter, a transmitter for other suitable electromagnetic signals, such as an LED for 
transmitting infrared light, e.g. via an IrDa port, radio-based communications, e.g. via a 
Bluetooth transceiver, or the like. Further examples of suitable transmitters include a cable 

25 modem, a telephone modem, an Integrated Services Digital Network (ISDN) adapter, a 

Digital Subscriber Line (DSL) adapter, a satellite transceiver, an Ethernet adapter, or the like. 
Correspondingly, the communications channel 109 maybe any suitable wired or wireless 
data link, for example of a packet-based communications network, such as the Internet or 
another TCP/IP network, a short-range communications link, such as an infrared link, a 

30 Bluetooth connection or another radio-based link. Further examples of the communications 
channel include computer networks and wireless telecommunications networks, such as a 
Cellular Digital Packet Data (CDPD) network, a Global System for Mobile (GSM) network, 
a Code Division Multiple Access (CDMA) network, a Time Division Multiple Access 
Network (TDMA), a General Packet Radio service (GPRS) network, a Third Generation 
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network, such as a UMTS network, or the like. Alternatively or additionally, the coding 
device may comprise one or more other interfaces 104 for communicating the coded stereo 
signal T to the decoding device 105. Examples of such interfaces include a disc drive for 
storing data on a computer-readable medium 110, e.g. a floppy-disk drive, a read/write CD- 
ROM drive, a DVD-drive, etc. Other examples include a memory card slot a magnetic card 
reader/writer, an interface for accessing a smart card, etc. Correspondingly, the decoding 
device 105 comprises a corresponding receiver 108 for receiving the signal transmitted by the 
transmitter and/or another interface 106 for receiving the coded stereo signal communicated 
via the interface 104 and the computer-readable medium 110. The decoding device further 
comprises a decoder 107 which receives the received signal T and decodes it into 
corresponding stereo components L' and R\ Preferred embodiments of such a decoder 
according to the invention will be described below. The decoded signals L' and R' may 
subsequently be fed into a stereo player for reproduction via a set of speakers, head-phones, 
or the like. 

Fig. 2 shows a schematic view of an arrangement 102 for encoding a stereo 
signal according to a first embodiment of the invention. The arrangement comprises circuitry 
201 for performing a rotation of the stereo signal in the L-R space by an angle a, resulting in 
rotated signal components y and r according to the transformation 

y = L cos a + R sin a = w L L + w R R 

r = -L sin a + R cos a = -w R L + w L R, (1) 

where wl=coscx and w R =sincc will be referred to as weighting factors. 

According to the invention, the angle a is determined such that it corresponds 
to a direction of high signal variance. The direction of maximum signal variance, i.e. the 
principal component, may be estimated by a principal component analysis such that the 
rotated y component corresponds to the principal component signal which includes most of 
the signal energy, and r is a residual signal. Correspondingly, the arrangement of fig. 2 
comprises circuitry 200 which determines the angle a or, alternatively, the weight factors w L 
and wr. 

Referring to fig. 3, according to a preferred embodiment, the above weight 
factors w L and w R are determined according the following algorithm: 
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Initially, the incoming stereo signals L and R are rectified and lowpass 
filtered, resulting in envelope signals p(k) of L and q(k) of R, respectively, where p(k) and 
q(k) are suitably sampled and the sample index is denoted k. Thus, the vector x(k) = (p(k), 
q(k)) denotes the incoming signal vector. Alternatively, the signals L and R may be used 

5 directly, i.e. without filtering, or other filtered versions of L and R may be used, e.g. highpass 
filtered signals L and R. In fig. 3 a number of signal points are illustrated as circles. As an 
example, the signal point x(k) and its corresponding components p(k) and q(k) are indicated. 
According to the invention, the signals are rotated in the direction of the principal component 
of the signal vectors. In the example of fig. 3, this corresponds to the y direction where a is 

10 the angle between the y direction and the p direction. The weight vector w = (w L , wr) 

indicates the direction of the principal component, and the rotated components of x(k) are 
denoted y(k) and r(k), respectively. 

The principal component may be determined by any suitable method known in 
the art. In a particularly advantageous embodiment, an iterative method utilising Oja's rule 

15 (see e.g. S. Haykin: "Neural Networks", Prentice Hall, N.J., 1999) is used. According to this 
embodiment, the weight vector w is iteratively estimated according to the following equation 

w(k) = w(k-l) + fx [x(k-l) - w(k-l) y(k-l)], (2) 

20 where w(k) = ( w L (k), w R (k) ) corresponds to the estimate at time k. The above iteration may, 
for example, be initiated with a set of small random weights w(0), or in any other suitable 
way. The above estimated weight vector may be used to calculate the rotated signal 
according to y(k) = w T (k)x(k). Alternatively, the iteration of eqn. (2) may be performed on a 
block basis, e.g. for a block of N samples, where N depends on the particular implementation, 

25 for example, N=512, 1024, 2048, etc. In this embodiment, the estimated weight vector w(N) 
for a block may be used in the transformation of all samples of that block according to y(k) = 
w T (N)x(k). 

The factor \x in eqn. (2) corresponds to a time scale of the tracking algorithm. 
If n=0, the weighting factors and, thus, the angle a, remain constant, while they change 
30 rapidly for large fx. As an example, for a block size of 2048 samples, \i may be selected of the 
order of 10" 3 for a sampling rate of 44.1 kHz. 

It is an advantage of the above iterative algorithm that it is linear, i.e. it does 
not require the calculation of any trigonometric functions, square roots or the like. It is a 
further advantage, that the above iteration yields a normalised weight vector w, as the term - 
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^iw(k-l)y(k-l) in eqn. (2) corresponds to a weight decay term penalising large weights while 
the term + ji x(k-l) drives the weight vector in the direction of the principal component. It is 
further noted that in the present embodiment, since x(k) is the envelope signal, Wl,wr e 
[0,1], i.e. the weight vector w lies in the first quadrant in fig. 3, thereby ensuring that \i is 
positive. It is a further advantage of this embodiment that it suffices to transmit one of w L and 

wr, as the other factor may be determined according to w R = ^/l-(w L ) 2 . Alternatively, the 

angle a may be transmitted. 

Again referring to fig. 2, the circuit 200 outputs the determined angle a or, 
alternatively, one or both of the weight factors w L and w R . The angle information is fed into 
the rotation circuit 201 which generates the rotated signal components y and r. It is 
understood that the circuits 200 and 201 may be combined in a single circuit performing the 
iterative calculation of eqn. (2) and the calculation of y and r according to eqn. (1). 

The arrangement further comprises encoders 202 performing a proper 
encoding of the signals y and r, respectively. For example, the signals may be encoded 
according to MPEG, e.g. MPEG I layer 3 (MP3) , according to sinusoidal coding (SSC), or 
audio coding schemes based on subband, parametric, or transform schemes, or any other 
suitable schemes or combination thereof. It is understood that the encoders 202 may be of the 
same type or of different types, e.g. one MP3 encoder and one SSC encoder, etc. The 
resulting coded signals y e and r e , respectively, are fed into a combiner circuit 204 together 
with the angle information a. The combiner circuit 204 performs framing, bit-rate allocation, 
and lossless coding, resulting in a combined signal T to be communicated. In one 
embodiment, the angle a or, alternatively, w L and/or w R may be communicated as part of a 
header transmitted prior to a signal frame, a signal block, or the like. According to the 
invention, as the transformation angle a is tracked such that the principal component signal 
includes most of the signal energy, the bit rates allocated to the y and r signals may be 
selected to be different, thereby optimising the coding efficiency. 

For example, the stereo signals L and R may be expressed as L=M+S and 
R=M-S, where M corresponds to a mid or centre signal and S corresponds to a stereo or side 
signal. In the case of an acoustic recording of a stationary sound source, e.g. a speaker 
recorded by two microphones, the L and R signals are substantially equal, if the speaker is 
positioned exactly between the microphones and assuming that there are no acoustic 
distortions such as reflections, etc. Hence, in this case S is substantially zero or at least small 
and the coding scheme according to the invention substantially yields y corresponding to 
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L+R and r corresponding to L-R being zero or small; this corresponds to a = 45 degrees. If 
the speaker is not positioned exactly between the microphones, i.e. there is an asymmetry, but 
still assuming that there are no reflections or other distortions, the rotated signal y according 
to the invention still corresponds to the speaker and the residual signal r is substantially zero. 
However, in this case the angle a differs from 45 degrees. If the sound source moves, e.g. 
from left to right, the method according to the invention still yields a principal component 
signal y corresponding to the source and a small residual signal r, ideally being r=0. In this 
case, a changes form 0 (fully left) to 90 degrees (fidly right). 

The above example illustrates the advantage of tracking the angle a. In the 
above situations it may suffice to transmit the principal component signal y and the angle a, 
in order to allow the decoder to reconstruct the original signals R and L without a significant 
loss in quality. In general, as the residual signal r is small compared to the principal signal, 
the bit allocation or coding efficiency is tradable between y and r. Hence, it is an advantage 
of the invention that it allows an efficient coding of stereo signals. 

Fig. 4 shows a schematic view of an arrangement 107 for decoding a stereo 
signal according to the first embodiment of the invention. The arrangement receives a coded 
stereo signal T, for example originating from an encoder according to the embodiment 
described in connection with fig. 2. The arrangement comprises a circuit 404 for extracting 
the encoded signals y e and r e and the angle information a from the combined signal T, i.e. the 
circuit 404 performs an inverse operation of the combiner 204 of fig. 2. The extracted signals 
y e and r e are fed into corresponding decoders 402 performing audio decoding corresponding 
to the encoding performed by the encoders 202 of fig. 2, resulting in the decoded principal 
component signal y* and the decoded residual signal r\ The signals y', r' and the angle 
information a are fed into a rotation circuit 401 which rotates the signals y', r' back in the 
direction of the original L and R components, thus resulting in the received signals L' and 
R\ 

Fig. 5 shows a schematic view of an arrangement for encoding a stereo signal 
according to a second embodiment of the invention. The arrangement comprises circuitry 201 
performing a rotation of the stereo signals L and R by an angle a, resulting in rotated signal 
components y and r, as described in connection with fig. 2. The arrangement further 
comprises circuitry 200 for determining the angle of rotation as described in connection with 
figs. 2 and 3. According to this embodiment of the invention, it is recognised that the residual 
signal r may be estimated as a filtered version of the principal signal y. As described in 
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connection with fig. 2, in an acoustic recording of an audio source recorded by two 
microphones in the absence of acoustic distortions, e.g. due to reflections, etc., the principal 
signal y corresponds to the audio source and the residual signal is substantially zero. 
However, in a more realistic situation distortions are present, e.g. due to reflections of the 
5 signal at the walls of a room and at the head and torso of the speaker, etc. These effects 

influence the residual signal r. Consequently, when estimating the residual signal by a filter, 
the filter in effect models the room acoustics, etc. For a classical orchestra the situation is 
similar, while in the case of modern pop music the situation may be slightly different. In this 
case, a sound engineer typically mixes multiple channels into two channels, often using 
10 artificial reverberation, effect boxes etc. In this case the filter models the acoustic effects 
introduced by the mixing process. 

Still referring to fig. 5, the arrangement comprises an adaptive filter 501 
receiving the principal signal y as an input and generating a filtered signal? . The filter 
parameters F p of the adaptive filter are selected such that the filtered signal f approximates 
15 the residual signal r, e.g. by controlling the adaptive filter 501 by the error signal e indicating 
the difference between r and f as generated by a subtraction circuit 502. The resulting filter 
parameters F p are fed into a combiner circuit 204, preferably after a proper encoding by an 
encoder 503, e.g. an encoder providing a Huffman encoding or any other suitable coding 
scheme. The filter 501 may be any suitable filter known in the art. Examples of such filters 
20 include a finite impulse response (FIR) filter or a infinite impulse response (IIR) filter, 
adaptive or fixed, with the cut-off frequencies and magnitudes being fixed or tracked 
recursively, or the like. The filter may be of any order, preferably smaller than 10. The type 
of the filter can be Butterworth, Chebychev, or any other suitable type of filter. The 
arrangement further comprises an encoder 202 for encoding the principal signal as described 
25 in connection with fig. 2, resulting in the encoded principal signal y e which is fed into the 
combiner circuit 204 together with the filter parameters F p and the angle information a. As 
described in connection with fig. 2, the combiner circuit 204 performs framing, bit-rate 
allocation, and lossless coding, resulting in a combined signal T to be communicated which 
includes the encoded principal signal y e , the filter parameters F p and the angle information a. 
30 According to this embodiment of the invention, the bit rate to be allocated to the filter 

parameters F p may be considerably smaller than the bit rate necessary for the principal signal 
y, e.g. in one embodiment, the bit-rate for F p may, on average, be less than 10% of the bit rate 
for y. Hence, it is an advantage of the invention that it reduces the bit rate necessary for 
transmitting a stereo signal. The total bit rate according to the invention is only slightly 
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higher than for a single mono channel. It is noted, however, that this ratio may vary during a 
recording. For example, the ratio may become smaller, e.g. in a situation with little 
distortions and a stationary source, but also larger, e.g. if the L and R signals are momentarily 
independent. 

Fig. 6 shows a schematic view of an arrangement 107 for decoding a stereo 
signal according to the second embodiment of the invention. The arrangement receives a 
coded stereo signal T, for example originating from an encoder according to the embodiment 
described in connection with fig. 5. The arrangement comprises a circuit 404 for extracting 
the encoded signal y e , the filter parameters F Pf and the angle information a from the 
combined signal T, i.e. the circuit 404 performs an inverse operation of the combiner 204 of 
fig. 5. The extracted signal y e is fed into a decoder 402 for performing audio decoding 
corresponding to the encoding performed by the encoder 202 of fig. 5, resulting in the 
decoded principal component signal y\ Preferably, the filter parameters are decoded by a 
decoder 602 corresponding to the encoding of the filter parameters by the encoder 503 of fig. 
5. The signal y* is fed into a filter 601 together with the received filter parameters F p . The 
filter 601 generates a corresponding estimated residual signal f 1 . The received principal 
component signal y\ the estimated residual signal f and the received angle information a 
are fed into a rotation circuit 401 which rotates the signals y\ f ' back in the direction of the 
original L and R components, thus resulting in the received signals L' and R\ 

In the embodiment described in connection with figs. 5 and 6, the filters 501 
and 601 may be a standard adaptive filter in the temporal or time domain (see e.g. "Adaptive 
Filter Theory", by S. Haykin, Prentice Hall, 2001), e.g. an adaptive filter known from the 
field of echo cancellation. Other examples of filters include a fixed FIR or IIR filter with a 
fixed or adaptive cut-off-frequency and magnitude. Alternatively, in one embodiment, the 
filter may be based on a psychoacoustic model of the human auditory system, e.g. as is 
known from MPEG coding, thereby reducing the number of filter parameters. According to 
yet another embodiment the filter is further simplified, e.g. by using a 10 th order filter using 5 
BiQuadratic filters and an artificial reverberation unit. In this embodiment, at the encoding 
side, the filter is fitted and the reverberation time is determined. These parameters are varying 
slowly, thereby reducing the necessary bit rate for their transmission. 

Figs. 7a-c show schematic views of examples of a filter circuit for use in an 
embodiment of the invention. 

In the example of fig. 7a, the filter 501 comprises a combination of a filter 701 
and a reverberation filter 702. For example, the filter 701 may be a standard adaptive filter in 
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the temporal or time domain, a fixed FIR or mt filter with a fixed or adaptive cut-off- 
frequency and magnitude, etc., e.g. a high-pass filter. According to this embodiment, both the 
filter parameters of the filter 701 and the parameters of the reverberation filter 702, such as 
the reverberation time denoted T 6 o, are transmitted to the decoder as filter parameters F p . 

In the example of fig. 7b, in addition to the filters 701 and 702, two control 
circuits 703-704 are added. A control circuit 703 is added to ensure that the average power of 
the residual signal r and the average power of the output of the reverberator 702 are 
approximately the same, e.g. by multiplying the output of the reverberator 702 with a 
parameter pi. A second control circuit 704 multiplies the scaled output of the reverberator 
with p 2 . The factor p 2 may be selected in the range between -3dB and + 6dB and it is 
determined such that the cross correlation p between r and P is as high as possible, i.e. that 
the signals r and P are as similar as possible. Hence, the filter arrangement of fig. 7b further 
comprises a circuit 705 for determining the cross correlation p. The filter arrangement further 
comprises a multiplier 706 for generating the product p=Pr p 2 which is output as a part of the 
filter parameters F p . Hence, pi is a gain that is automatically controlled, e.g. by comparing 
the absolute mean of r and P , and p 2 is another gain that is automatically controlled, e.g. by 
use of the cross-correlation coefficient p. The first gain is intended to make sure that the 
energy of r is preserved, i.e. that the energy of the predicted signal P' at the receiver 
corresponds to the energy of r. The second gain is to make sure that r and P' are well 
correlated. 

In one embodiment, the reverberator 702 and the filter 701 may be fixed, i.e. 
not adapted according to the filter parameters F p . Further, p 2 maybe fixed, thereby leaving 
the slowly varying parameter pi as the only adaptive parameter which needs to be adjusted 
and transmitted. Consequently, a particularly simple filter arrangement is provided. It is an 
advantage of this embodiment that it only requires about half the original stereo bit rate for 
transmitting a stereo signal. It is noted that further variations of the above embodiment may 
be used. For example, in one embodiment the filter 701 may be left out. 

Furthermore, alternatively or additionally to the correlation p, other measures 
of correlation maybe used to ensure a high degree of similarity between the original signal 
and the signal after encoding-decoding. For example, in one embodiment two correlators may 
be used instead of correlator 705. One correlator may compute the cross-correlation plr of 
the input signals L and R. Furthermore, a second correlator may compute the cross 
correlation p'lr of the resulting outputs L 1 and R' of the encoder-decoder, i.e. according to 
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this embodiment, the encoder further comprises a decoder circuit for determining the signals 
L 1 and R f . This embodiment uses the difference 6 p =plr - p'lr to control p 2 such that s p is 
minimal. This is illustrated in fig. 7c, where the correlator of fig. 7b is replaced by circuit 707 
which receives the signals L and R as well as L' and R' as inputs and generates as an output a 
5 signal indicative of the difference s p . The output s p of circuit 707 controls circuit 704 to scale 
the estimated residual f such that s p is minimised. In one embodiment, the inputs to circuit 
707 are high-pass filtered, e.g. at 250Hz, such that the low frequencies have a decreasing 
contribution to e p . As in the embodiment of fig. 7b, it is an advantage of this embodiment that 
the correlation between the resulting stereo image and the original stereo image before the 
1 0 coding-decoding is very high. 

Fig. 8 shows a schematic view of an arrangement for encoding a stereo signal 
according to a third embodiment of the invention. The arrangement is a variation of the 
embodiment described in connection with fig. 5, and it comprises circuitry 201 for 
performing a rotation of the stereo signals L and R, circuitry 200 for determining the angle of 
15 rotation, an adaptive filter 501, a subtraction circuit 502, an encoder 202, an encoder 503, and 
a combiner circuit 204, as described in connection with fig. 5. According to this embodiment, 
the principal component signal y is not directly fed into the filter 501. Instead, the 
arrangement further comprises a decoder 402 as described in connection with fig. 6. The 
decoder 402 receives the encoded principal component signal y e generated by the encoder 
20 202 and generates the decoded principal signal y* which is fed into the filter 501. It is an 
advantage of this embodiment that it reduces the effect of coding errors introduced by the 
coding and decoding of the signal y. These coding errors cause the decoded signal y 5 to be 
slightly different from the original signal y due to the fact that the decoder 402 in practice is 
not a perfect inverse of the encoder 202, i.e. E E" 1 * 1. Consequently, by applying an 
25 encoding and decoding of the signal y at the decoder, the input y* to the filter 501 

corresponds to the input y' fed into the filter 601 at the receiver, thereby improving the result 
of the prediction of P of the residual signal at the receiver. Hence, the encoder according to 
this embodiment may be used in connection with a decoder according to the embodiment of 
fig. 6. 

30 Fig. 9 shows a schematic view of an arrangement for encoding a stereo signal 

according to a fourth embodiment of the invention. The arrangement is a variation of the 
embodiment described in connection with fig. 5, and it comprises circuitry 201 for 
performing a rotation of the stereo signals L and R, circuitry 200 for detenniiiing the angle of 
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rotation, an adaptive filter 501, a subtraction circuit 502, an encoder 202, an encoder 503, and 
a combiner circuit 204, as described in connection with fig. 5. According to this embodiment, 
the principal component signal y is not directly fed into the filter 501. Instead, the 
arrangement further comprises a multiplication circuit 901 multiplying the residual signal r 
received from circuit 201 with a constant y, and an adding circuit 902 for adding the scaled 
residual signal to the principal component signal y, resulting in a signal y + y r which is fed 
into the filter 501 . Here, y is a small positive value, e.g. of the order of 10" 2 . In one 
embodiment, the constant y is tracked adaptively. It is an advantage of this embodiment that 
frequencies which are substantially not present in the spectrum of the signal y but present in 
the spectrum of r may be utilised in the modelling of the residual signal P by the filter 501, 
thereby improving the quality of the coded signal. According to this embodiment the signal y 
+ y r is fed into the encoder 202 which generates the decoded principal signal y e to be 
transmitted to the receiver. Furthermore, according to this embodiment, the constant y is fed 
into the combiner 204 and transmitted to the receiver. 

Fig. 10 shows a schematic view of an arrangement for decoding a stereo signal 
according to the fourth embodiment of the invention, i.e. suitable for decoding a signal 
received from an encoder according to fig. 9. The arrangement comprises a circuit 404 for 
extracting the received information from the combined signal T, a decoder 402, a decoder 
602, a filter 601, and a rotation circuit 401 as described in connection with fig. 6. According 
to this embodiment, the circuit 404 further extracts the constant y from the combined signal 
T, and the arrangement further comprises a multiplication circuit 1001 for multiplying the 
predicted residual signal f generated by the filter 601 with the received constant y. The 
arrangement further comprises a circuit 1002 for subtracting the resulting scaled predicted 
residual signal y F from the decoded principal signal y\ 

Fig. 11 shows a schematic view of an arrangement for encoding a multi- 
channel signal according to a fifth embodiment of the invention. The arrangement receives a 
multichannel signal x = (xi,. . .,x n ) comprising n channels. The arrangement comprises a 
principal component analyser 1100 for performing a principal component analysis of the 
signal x, resulting in a weight vector w = (wi,. . .,w„) for transforming the input signal x into a 
principal component signal y and n-1 residual signals n, r 2 , . . ., r n -i. The arrangement further 
comprises a transformation circuit 1 101 receiving the input signal x and the determined 
weight vector w and generating the signals y and ri, . . ., r„.i according to the above 
transformation. The transformed signals are encoded by suitable encoders 202 and 1 102 and 
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combined by a combiner circuit 204 together with the weight vector w, as described in 
connection with fig. 2. According to this embodiment, the encoder 1 102 is adapted to encode 
the residual signals n, . • r n -i. For example, the encoder 1 102 may comprise n-1 parallel 
encoders, each encoding one of the residual signals as described in connection with encoder 
202. 

Fig. 12 shows a schematic view of an arrangement for encoding a multi- 
channel signal according to a sixth embodiment of the invention. In addition to the 
transformation of a multichannel signal as described in connection with fig. 1 1 , according to 
this embodiment, the principal component signal is fed into a set of adaptive filters 501, each 
predicting one of the residual signals n,. . .,r n -i, as described in connection with fig. 5, 
resulting in corresponding filter parameters F p i,. . ., F p ( n -i) which are fed into corresponding 
encoders 503 and, subsequently, into the combiner 204. At a corresponding decoder (not 
shown), corresponding filters are used for generating estimates t\ ,. . ., f f n _i of the residual 
signals based on the filter parameters, as described in connection with fig. 6. 

It is understood that, according to one embodiment, only a subset of residual 
signals, e.g. ri,. . .,r k , k<n-l, may be transmitted to the receiver or fed into corresponding 
filters, thereby reducing the necessary bit rate while maintaining most of the signal quality. 

Fig. 13 shows a schematic view of a subtraction circuit for use with an 
embodiment of the invention. In the above embodiments of figs. 5, 8, 9, and 12, the filter 
parameters are determined by comparing a target signal with an estimated signal, i.e. by the 
error signal e indicating the difference between r and P as generated by a subtraction circuit 
502. It is understood that the subtraction circuit may generate different measures of 
difference between r and f , for example a difference may be determined in the time domain 
or in the frequency domain. Referring to fig, 13, the circuit 502 may comprise circuits 1301 
for transforming the signals r and P, respectively, into the frequency domain, e.g. by 
performing a fast Fourier transformation (FFT). The resulting frequency components may be 
further processed by respective circuits 1304. For example different frequencies may be 
weighted differently, preferably according to the properties of the human auditory system, 
thereby weighting differences in the audible frequency range more strongly. Other examples 
of further processing by the circuits 1304 include an averaging over predetermined frequency 
components, calculating the magnitude of the complex frequency components, clustering of 
filter components, or the like. For example, in a preferred embodiment, a clustering is 
performed prior to the subtraction in the frequency domain. This clustering may be 
performed using a filter-bank, e.g. with linear or logarithmic sub-bandwidths. Alternatively, 
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the clustering may be performed using the so-called equivalent rectangular bandwidth (ERB) 
(see e.g. "An introduction to the Psychology of Hearing" by Brian Moore, Academic Press, 
London, 1997). The equivalent rectangular bandwidth technique clusters frequency-bands 
that correspond to the human auditory filters, e.g. the so-called critical bands. According to 
this embodiment, the corresponding value of the ERB as a function of centre frequency, f (in 
kHz), is may be calculated according to ERB = 24.7( 4.37 f + 1 ). Still referring to fig. 13, the 
circuit 502 further comprises a subtraction circuit 1303 for subtracting the processed 
frequency components. Alternatively, the transformed signals generated by the circuits 1301 
are directly fed into the subtraction circuit 1304 without further processing. The difference 
signal generated by the subtraction circuit 1304 is fed into a transformation circuit 1302 for 
transforming the error signal back into the time domain, e.g. by performing an inverse fast 
Fourier transform (IFFT). Alternatively, the difference signal in the frequency domain may 
be used directly. 

It is understood that a skilled person may adapt the above embodiments, e.g. 
by adding or removing features, or by combining features of the above embodiments. For 
example, it is understood that the features introduced in embodiments of fig. 8 and 9 may be 
incorporated in the embodiment of fig. 12 as well. As another example, the error signal e 
describing the quality of the estimated residual signal in the embodiment of fig. 5 may be 
compared to a threshold error indicating a maximum acceptable error. If the error is not 
acceptable, the error signal may, after suitable coding, be transmitted together with the signal 
T similar to the methods used within the field of Linear Predictive Coding (LPC). 

It is further noted that the invention is not limited to stereophonic signals, but 
may also be applied to other multi-channel input signals having two or more input channels. 
Examples of such multi-channel signals include signals received from a Digital Versatile 
Disc (DVD) or a Super Audio Compact Disc, etc. In this more general case, a principal 
component signal y and one or more residual signals r may still be generated according to the 
invention. The number of residual signals transmitted depends on the number of channels and 
the desired bit rate, as higher order residuals may be omitted without significantly degrading 
the signal quality. 

In general, it is an advantage of the invention that bit-rate allocation may be 
adaptively varied, thereby allowing graceful degradation. For example, if the communication 
channel momentarily only allows a reduced bit rate to be transmitted, e.g. due to increased 
network traffic, noise, or the like, the bit rate of the transmitted signal may be reduced 
without significantly degrading the perceptible quality of the signal. For example, in the case 
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of a stationary sound source discussed above, the bit rate may be reduced by a factor of 
approximately two without significantly degrading the signal quality, corresponding to 
transmitting a single channel instead of two. 

It is noted that the above arrangements may be implemented as general- or 
special-purpose programmable microprocessors, Digital Signal Processors (DSP), 
Application Specific Integrated Circuits (ASIC), Programmable Logic Arrays (PLA), Field 
Programmable Gate Arrays (FPGA), special purpose electronic circuits, etc., or a 
combination thereof. 

It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention, and that those skilled in the art will be able to design many alternative 
embodiments without departing from the scope of the appended claims. In the claims, any 
reference signs placed between parentheses shall not be construed as limiting the claim. The 
word 'comprising' does not exclude the presence of other elements or steps than those listed 
in a claim. The invention can be implemented by means of hardware comprising several 
distinct elements, and by means of a suitably programmed computer. In a device claim 
enumerating several means, several of these means can be embodied by one and the same 
item of hardware. The mere fact that certain measures are recited in mutually different 
dependent claims does not indicate that a combination of these measures cannot be used to 
advantage. 



